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TITLE OF THE INVENTION 

A METHOD OF EDITING STURCTURED DOCUMENTS 
BACKGROUND OF THE INVENTION 

1. Field of the Invention 

5 The present invention relates generally to document editing techniques. 

More specifically, the present invention relates to document editing techniques for 
extracting elements of structured documents and generates a desired output 
document 

2. Description of the Related Art 

10 is known in the art to extract necessary elements from a plurality of 

documents and deal them for generating an output document. A conventional 
editing technique for obtaining such an output document is disclosed in Japanese 
Laid-open Paten Application No. 5-259421. According to this conventional 
technique, the elements of the structured input document are extracted using 
1 5 matches of character sequences with the elements in the document, a sequence 
connector, a hierarchy connector, etc. 

However, the aforesaid conventional technique is not provided with an 
element connector and thus, it is undesirably required to use one character pattern 
for establishing a match with the element in the document and extracting the 
20 matched element. In other words, if a plurality of different elements are to be 
extracted, the same number of different character patterns are necessary. 

Further, the aforesaid conventional technique is unable to extract a plurality 
of elements while maintaining relationship of the elements extracted- This is 
because the elements extracted using different character patterns are independent 
25 with one another. 

The conventional technique in question is provided with a sequence 
connector for extracting a plurality of elements in sequence. However, the 
conventional technique is not provided with any connector via which a plurality of 
elements are extracted in parallel in any order. Therefore, it is difficult to extract a 
30 plurality of elements from input documents if the arrangement order or elements is 
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different with each document 

SUMMARY OF THE INVENTION 

It Is therefore an object of the present to provide to overcome the above- 
mentioned difficulties of the conventional editing techniques. 
5 In brief, these objects are achieved by a technique of editing a plurality of 

structured documents is disclosed. A plurality of structured documents are inputted in 
a document edit system. Thereafter, a plurality of elements are extracted from each of 
the plurality of structured documents using an element edit statement which indicates 
element to be extracted. The extraction of the elements is implemented wnile the 
1 0 relationship of the elements extracted is maintained. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The features and advantages of the present invention will become more clearly 
appreciated from the following description taken in conjunction with the accompanying 
drawings in which like elements are denoted by like reference numerals and in which: 
1 5 Figs. 1 to 9 are each diagram for describing connectors and notations used 

in the present invention: 

Fig, 10 is a diagram showing a document edit system used in a first preferred 
embodiment of the present invention; 

Fig. 11 ts a diagram showing input and output documents together with the 
20 functional block of the system of Fig. 1 0; 

Fig. 12 is a diagram showing two structured documents that respectively 
correspond to the input documents of Fig. 11; 

Fig. 13 is a diagram showing a structured document that corresponds to the 
output document of Fig. 11; 
25 Figs. 14 to 1 7 are each flow chart which includes the steps which 

characterize the operation of one example of the first embodiment; 

Fig. 18 is a diagram showing a structured document used in another 
example of the first embodiment; 

Fig. 19 shows a diagram showing a document edit system used in a second 
30 preferred embodiment of the present invention; 
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Fig. 20 (s a diagram showing input and output documents together with the 
functional block of the system of Fig. 19; 

Fig 21 is a diagram showing a structured documents that corresponds to the 
output document of Fig. 11; 
5 Figs. 22 to 28 are each flow chart which includes the steps which 

characterize the operation of the second embodiment; 

Fig. 29 shows a diagram showing a document edit system used in a third 
preferred embodiment of the present invention; 

Fig. 30 is a flow chart which includes the steps which characterize the 
10 operation of the third embodiment; and 

Figs. 31(A) to 31 (B) are each showing an output document obtained in the 
third embodiment. 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENTS 

1 5 Preferred embodiments of the present invention will be described with 

reference to the accompanying drawings. Throughout the instant disclosure, it is 
assumed, for a convenience of description, that the document to be processed or 
edited is an SGML structured document. However, the present invention is 
applicable to the documents other than SGML ones on the condition that the 
20 document is tagged or hierarchically structured. 

Before turning to the concrete embodiments, it is preferable to define edit 
operators and notations, both used in the instant invention, 
(a) Tag: 

Tag is delimited using the '<' and characters. Tags are used to define an 
25 element which is an identified component of a document. The element usually 

consists of a start-tag, content and an end-tag. However, an element may involve 
one or more elements. The end-Tag is delimited by '<r and ">'. Element matching 
operation using the tag is briefly described with reference to Fig. 1. As shown In 
Fig. 1 , an element edit statement 1 0 includes a tag <note>. A reference numeral 
30 12 denotes a structured document The tag <note> in the element edit statement 
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10 is used to implement the element matching with one or more elements involved 
in a document portion 14 that is an document area to be searched. If the document 
portion 14 contains an element named <noie>, a match is established. 

(b) Character Pattern: 

5 The character pattern, which consists of normal text characters (such as 

typically used in UNIX commands) of a character sequence, is delimited or defined 
using the " (left double quotation mark right) and " (right double quotation mark). 
The character pattern is used to search for document data which includes at least 
one character sequence defined by the character pattern. That is, the existence of 

10 document data is ascertained if a match is established between the character 
pattern and the document data. In the instant disclosure, the document data 
signifies a document portion excluding tags. One example of matching using 
character pattern 5s described with reference to Fig. 2. As shown in Fig. 2, the 
element edit statement 1 0 includes a character pattern "the". The structured 

15 document 12 involves an element defined by <qoo> and </ooo> which contains a 
character sequence of "the" and thus, a match is established between character 
pattern "the" and the document data within the element defined by <ooo> and 
</ooo>. 

(c) Wild Card Tag <*>: 

20 The wild card tag is defined by the '*' character delimited using '<* and '>'. and 

matches each of document elements hierarchically structured. One example of 
matching of the wild card tag ■<*> with the document elements is shown in Fig. 3. 
As shown in Fig. 3, the element edit statement 10 includes the wild card teg <*>. 
The structured document 12 includes a document portion 16 to be searched for 

25 editing, which portion includes a hierarchy formed by "chapter", "section", and 

"paragraph", in the case shown in Fig. 3. the following matches are established. 

(c-1) The wild card tag matches "zero" hierarchical layer (viz., matches 
"vacant" structure) in connection with "chapter". 

(c-2) The wild card tag <*> matches the hierarchical layer (viz., element) of 

30 "section". 
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(c-3) The wild card tag <=*=> matches the hierarchical layer (viz., element) 
named "paragraph" within the element of "section". 

(d) Negation Indicator '!': 

The negation indicator is defined by the T character (exclamation character). 
5 The negation indicator is used to specify an element wherein a match is not 

established with a character sequence immediately following the negation indicator. 
Fig. 4 is a diagram schematically showing one example of matching of a character 
sequence "this" preceded by the negation indictor with an element in a document 
portion 18 of the structured document 12. The character sequence "The" is defined 
10 In the element edit statement 1 0. 

(e) Extraction Indicator '%': 

The extraction Indicator is defined using the '%' character, and is used to 
extract an element from the structured document, which element matches an 
element designated by a tag which follows the extraction indicator '%'. One 

15 example of the usage of the extraction indicator is described with reference to 
Fig. 5. As shown in Fig. 5, the element edit statement 10 contains a tag <note> 
which is preceded by the extraction indicator The structured document 12 

involves the document potion 14 that is currently to be searched. In this case, the 
element including tags <note> and </note> is specified and then extracted. 

20 (f) Sequence Connector 

The sequence connector is defined by the ',' (comma) character. 
Condiering the example of <A>,<B>, in the case of which the element specified by 
tag A must precede the element specified by tag B. However, a match is also 
established even if another element exists between the elements respectively 

25 defined tags A and B. One example of the sequence connector ',' is shown in Fig. 
6. As shown, the element edit statement 10 contains the sequence connector ',' 
between tags <beginning^» and <ending>. The structured document 12 involves a 
document portion 20 which contains the elements "beginning" and "ending" in this 
order. As a result, matches are established as illustrated. 

30 (g) Hierarchy Connector: 
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The hierarchy connector is defined by inserting no character between 
adjacent tags. A match is established only if the element (defined by the first tag 
and denoted by C) preceding the hierarchy connector contains the element (defined 
by the second tag and denoted by D) following the hierarchy connector in the 
5 document portion to be searched. It is to be noted that a match is also established 
even if there exists another element between the elements C and D. One example 
of usage of the hierarchy connector is described with reference to Fig. 7. As 
shown in Fig. 7, the element edit statement 10 contains the hierarchy connector 
between two tags <section> and <paragraph>. The structured document 12 
1 0 involves a document portion 1 6 which is currently to be searched for and contains 
the hierarchical data structures <chapter>, <section> and <paragraph>. in this 
case, the element <section> contains the element <paragraph> and thus, a 
successful match is established as illustrated. 

(h) Parentheses: 

15 The parentheses '<' and ')' indicate that the element(s) within the 

parentheses is preferentially processed. 

(i) AND Connector 

The AND connector is defined using the character. The example '(E & F) 
indicates that 'F' may either follow or precede 'E' in the document portion to be 
20 searched for editing. Referring to Fig. 8, one example of usage of the AND 

connector is shown. As shown, the element edit statement 10 contains the AND 
connector sandwiched by two tags <beginning> and <ending>. The structured 
document 12 contains the elements, defined by the tags -<beginning>* and <ending:>, 
In the document portion 20 and thus, a successful match is established as 
25 illustrated. 

(j) OR Connector ' |* : 

The OR connector Is defined by the* I' character. The example (G I H) 
indicates that "G" may be present or "H" may be present in the document portion to 
be searched for editing. The OR connector ( | ) is further described with reference 
30 to Fig. 9. As shown, the element edit statement 1 0 contains the OR connector 
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sandwiched by two tags <beginning> and <ending>. The structured document 1 2. 
contains onfy the element <beginnjng> in the document portion 22 and thus, one 
successful match is established as illustrated. 

Referring to Fig. 10, there is shown a structured document edit system 30 of 
5 a first embodiment of the present invention in block diagram form. 

As shown m Fig. 10, a controller 32, which typically takes the form of central 
processing unit (CPU), ia provided to control the overall operation of the system 30. 
A memory 34 is arranged so as to receive a plurality of documents, on a one-by-one 
basis (usually), from a data base 36 which has stored a plurality of structured 
10 documents. A memory 33 is provided for storing one or more element edit 

statements. Typically, one statement is applied to the memory 38 via a suitable 
input interface 40. 

A document edit engine 42 accesses the memories 34 and 38, and edits the 
document stored in the memory 34 using the element edit statement, and applies 
15 the edit results (in sequence or in batch) to a memory 44 to store the same therein. 
The edit result is adaptlvely outputted using a document retriever 46 and an 
appropriate output interface 48. In this case, the edit result may be arranged using 
an edit result arranging data applied to the edited document retriever 46. 

A first example of the first embodiment will be described with reference to 
20 Figs. 11-17. 

Fig. 1 1 is a diagram which shows a plurality of input documents (only two 
documents SO and 52 are shown), an output document 54, and the document edit 
system 30 (Fig. 10). The output document 54 is a list showing the edit result after 
being subject to listing order arrangement . The first example of the first 
25 embodiment is to edit a plurality of input documents, including the input documents 
50 and 52, so as to, in this particular case, extract the names of paper's authors and 
corresponding paper's titles, after which the edit (extracting) results are listed. 
As shown, the input document 50 contains the following items. 
# Summary of paper 
30 # Name of Society: ABC Meeting 
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# Title: DEF Report 

# ; : Name: Tare SATO 

ff : Name: Hanako SUZUKI 

# . Abstract: Until recently. 

5 On the other hand, the input document 52 contains the following items, 

ff Summary of paper 

# • Name of Society: GHi Meeting 

# ' Title: Analysis of JKL 

# Name: Jiro NAKAMURAO 
10 # ' Abstract: This report, ... 

The output document 54 contains the fallowing items which are results of 
document editing; carried out by the edit system 30. 

# List A 

# Name Title 

1 5 Taro SATO DEF report 

; Hanako SUZUKI DEF report 
Jiro NAKAMURA Analysis of JKL 

Fig. 12 shows the structured input documents SO' and 52' that respectively 
20 correspond to the input documents 50 and 52, while Fig. 1 3 shows the structured 
document 54.' that corresponding to the output document 54. It is understood that 
the content of each of the items ("SUMMARY OF PAPER", "NAME OF SOCIETY", 
etc.) is defined using a start-tag defined by '<* and '>' and an end-tag defined by "</* 
and '>". The detailed descriptions of Figs. 12 and 13 are deemed redundant and 
25 accordingly, will be omitted for brevity. 

The operation of editing The input documents 50 and 52 and generating the 
output document 54, will be described using Figs, 14-17 together with Fig. 10-13. 

Fig. 14 is a flow chart which shows the steps which characterize the overall 
operation or the document data editing according to the first embodiment. In Fig. 
30 14. at step 60, the first input document 50 is appfied to the memory 34 form the 
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database 36. Assuming that an element edit statement is: 

%<TitJe>.^><name> ... element edit statement A 

which is referred to as the element edft statement A for the sake of simplifying the 
discussion. At step 62, the element edit statement A is written into the document 
5 edii engine A2, aner wnich the routine proceeds to a sub-routine 64 for element 
extraction. The sub-routine 64 is shown in Fig. 1 5 in detail. 

Referring to Fig. 15, at step 66, if the element edit statement, retrieved into 
the document edrt engine 42, contains one or more than one "OR connectors", the 
element edit statement is divided into a plurality of element edit instructions 

10 (operators), such as %<titte> and %<narne>, every "OR connector". However, in 
the instant case, there exists no "OR connector in the element edit statement A and 
Thus, the step 66 is not executed. At step 68, one of the element edit instructions 
divided at step 66 is selected. However, as mentioned above, no division of the 
statement is carried out at step 66 and thus, the element edit statement A is 

1 5 selected as a whole, after which the routine goes to a sub-routine 70, the details of 
wnich is shown in Fig. 16. 

Referring to Fig. 16, at step 72, the element edit statement containing "AND 
connector (&)" is processed. Although no "AND connector" is contained in the 
element edit statement A, the flow chart of Fig. 1 6 is described in that another 

20 example, set form later, includes "AND connector. At step 72, if the element edit 
statement, applied to the document edit engine 42 (Fig. 10), contains one or more 
"AND connectors", the statement is divided into a plurality of element edit 
instructions or tags every each "AND connector. However, in the instant case, no 
"AND connector is contained In the element edit statement A and thus, the step 72 

25 is not executed. At step 74, one of the element edit Instructions or tags, divided at 
step 72 is selected. However, as mentioned above, no division of the statement is 
carried out at step 72 and thus, the element edit statement A is selected as a whole, 

after which the program goes to a sub-routine 76, the details of which is shown in 

Fig. 17. 

30 In Fig. 17, at step 78, the first element edit instruction (or tag) of the element 
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edit statement A is selected for execution. Following this, the program proceeds to 
step 80 at which a check is made to determine rf the edit instruction (or tag) selected 
at step 78 (viz., <tit!e> in the instant case) is parenthesized. The element edit tag 
<tit!e> is not parenthesized and thus, the routine goes to step 82 at which the tag 
5 *=true> is processed to determine if the element "title" matches any element of the 
structured document 50'. In the instant case, a match is established with the 
following element in the structured document 50': 
<rme> 
DEF Report 
10 </title> 

Subsequently, at step 84, a check is made to determine if the matching execution 
has completed. : In this case, since the answer to the inquiry made at step 84 is 
negative, the routine proceeds to step 56. The element edit statement contains the 
sequence connector ',' and hence, at step 86, the document portion following the 

1 5 element which has matched at step S2, becomes the next document portion to be 
searched. Following this, at step 88, the next element edit instruction (tag) (viz., 
<name>) is selected and the routine goes back to step 80. Thus, the following 
elements in the structured document 50' are matched: 
<name> : 

20 Taro SATO 

</name> 

and 

<name? 
Hanako SUZUKI 
25 </name> ; 

Since the matching execution in connection with the document 50" completed, the 
answer to the inquiry made at step 84 is positive. Thus, the routine goes to step 90 
at which a check is made to determine if any match has been established. In this 
case, the matches are established as mentioned above, the routine proceeds to 
30 step 92 at which a check is further mad© to determine if the extraction indicator '%' 
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exists the statement A, At step 94, since the extraction indicator '%' is involved in 
the element edit ktatement A, the matched elements are extracted and stored in the 
memory 44. Thereafter, the routine goes to step 96 of Fig. 16. In the above, if the 
answer at step 80 Is positive, the parentheses are deleted at step 81 and the routine 
5 jumps to the sub-routine 64 (Fig. 1 5), 

After executing step 94 (Fig. 17), the routine goes to step 96 at which a check 
is made to determine if all the element edit instructions (or tags), divided at step 72, 
have been processed. In the instant example, the element edit statement A does 
not involve any AND connector and hence, steps 96, 98, and 100 are not executed 

1 0 and the routine goes to Step 1 02 (Fig. 15). In the instant example, the element edit 
statement A does not involves any OR connector and hence steps 1 02, 1 04 and 1 06 
(Fig. 15) are not executed. 

Subsequently, the routine goes to 108 (Fig. 14) at which a check is made to 
determine if the element edit statement to be processed remains. Since no 

15 element edit statement to be processed with respect to the document 50' remains, 
the routine goes to step 110. The document 110 has not yet been processed and 
hence, the routine goes back to step 60 at which the next document 52' is inputted 
to memory 34, after which the above mentioned processing is carried out in the 
same manner. The processing of the document 52' is clear from the foregoing, the 

20 description thereof will be omitted for brevity. It is understood that the element 
extracted from the document 52' is: 
^title> 

ANALYSIS OF JKL 
</title> 

25 and 

<name> 

Jiro NAKAMURA 
</name> 

All the elements thus extracted are stored in the memory 44 as the edit result. 
30 Thereafter, the edit result is retrieved by the edited document retriever 46, which 
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arranges and add data (or name(s)) if the edit result arranging data is applied 
thereto. In the instant example, the name "LIST A" is added to the top and, the 
names of the authors are listed at the left side of the list Subsequently, the 
document named "LIST A" is outputted from the system 30. The above operations 
5 are implemented: at step 112. 

A second example of the first embodiment will be described. This example 
extracts the items of "name of society" and "title" from the documents 50 and 52, 
and outputs the edited document as a list B which is shown below. 
LISt B 

10 NAME OF SOCIETY TITLE 

ABC MEETING EDF REPORT 

CHli MEETING ANALYSIS OF JKL 

In this example, the element edit statement is: 

%<society>&%<titfe> . . . element edit statement B 

1 5 As shown, the above statement is referred to as "element edit statement B" for the 
sake of convenience of description. This example features that the elements 
redefined by the tegs in the statement B are indenpently matched and then 
extracted. Consequently, even if the items in the input documents 50 and 52 are 
differently arranged, the element extraction can be implemented without difficulty. 

20 In the second example, the tags <society> and <title> are divided at step 72 

(Fig. 16) and then! the matching of the elements, defined by the tags <society> and 
<title>, in each of the documents 50' and 52' are able to be carried out 
independently. The extracted Items (elements) are arranged, using the edit result 
arranging data applied to the edited document retriever 46 (Fig. 10), so as to meet 

25 the requirement shown above. Further, the name "LIST B" can be added to the 
output document using the aforesaid edit result arranging data. 

A third example of the first embodiment will be described, which includes OR 
connector. An input document, indicated as a structured document, is shown in 
Fig. 18 and denoted by numeral 120. As shown in Fig. 18, an outermost element is 

30 a paper element which contains a first-p (first paragraph) element and a second-p 



Rscai vsd 07-21-99 03:31pm From-35462068 



To-FOLEY AND LARDNER Paga 13 



•93 07/22 04:39 "035482068 



MORISAKI 



(11014 



-13- NE-840 

(second paragraph). Further, the first-p element contains a figure element, and 
similarly, the secpnd-p element contains a figure element. The third example is to 
extract the figure element from each of the first-p and second-p. Thus, the element 
edit statement in;the third example is: 
5 (<first-p> |; <second-p>)%<figure> ...element edit statement C 

As shown, this statement is referred to as "element edit statement C" for the sake of 
convenience of description. It is to be noted that (<tfirst-p> I -=second-p>) 
and %<fjgure> are connected by the hierarchy connector since no character is 
provided therebetween. The third example contains OR connector and thus, the 

1 □ tags within the parentheses are divided at step 66 (Fig. 1 5), after which these tags 
are separated and are subject to the element matching process on an element-by- 
elemcnt basis. As mentioned above, the hierarchy connection is used, the element 
<figure^- of each of the elements -^first-p>- and <secand-p? is extracted. 

A second embodiment of the present invention will be descirbed with 

15 reference to Figs: 19-28. 

Referring to Fig. 19, there is shown a structured docment edit system 
denoted by 30* which, in addition to the arrangement shown in Fig. 10, comprises 
the following new blocks. 

(1) block 130: document edit engine 2. 

20 (2) block 1 32: memory for storing an edited document. 

(3) block 134: document content changing means. 

(4) block 1 36: sort engine. 

(5) block 1 38: memory for storing one or more initializing instructions. 

(6) block 140: memory for storing one or more pre-edit instructioins. 
25 (7) block 142: memory for storing one or more post-edit instructions. 

(S) block 144: memory for storing one or more editted document arranging 
instruction(5). 

In order to differentiate the block 42 from the block 130, the block 42 is 
indicated by the document edit engine 1 . 
30 Fig. 20 shows two input documents 146 and 148, the edit system 30' of Fig. 
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19. and an output document 150. The input documents 146 and 148 are 
respectively identical with the documents 50 and 52 of Fig. 1 1 , and thus, the further 
descriptions thereof will be deemed redundant and accordingly omittted for 
simplifying the inkant disclosure. The ouput document 1 50 is a list C which 
5 includes the title pf the paper and the number of authors, which are extracted from 
the documents 146 and 14S. Further, the total number of the authors are 
calculated and listed. Still further, the name "title" in each of the input documents 
146 and 148 is changed to "Title of Paper" in the list C, while the names "Number of 
Authors" and "Total Number" are added in the list C. 
10 Fjg, 21 shows a structured out document 150" which corresponds to the 

output document; 150 of Fig. 20. 

In order to: generate the list C shown in Fig. 20, element edit statements D-1 

and D-2 are Inputted to the memory 38. More specifically, 
The element edit statement D-1 contains: 
15 (1) Edit statement 1 which is %<tit!e> for extracting the element <title=»; 

(2) Edit statement 2 for changing the name Trtle" to "TTtel of Paper" and 
storing the changed name in a variable "title". 

The element edit statement D-2 contains: 

(1) Edit statement 3 which is %<name> for counting the number of authors; 

20 (2) Edit statement 4 which counts the number of extracted names and stores 

the counted number in a variable "count". 

The operation of the second embodiment will be described with reference to 
Figs. 19-28. In Fig. 22, an operation starts with a sub-routine 200 the detail of 
which is shown in Fig. 23. At step 200 (viz., sub-routine 200), the initializing 

25 instructions, which are retrieved from the memory 1 36, includes two instructions, 
one of which is to reset a variable "total" which stores the total number of the 
authors and the other of which is to reset a variable "table" provided for arranging 
the edit result andigenergting the list C. At step 202, one input document is applied 
to the system 30', after which a pre-edit operation is implemented at step 204 (viz., 

30 sub-routine 204). : Following this, a plurality of operatloins for generating the output 
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document 150* are carried out at the following steps as shown in Figs. 22-28, 

A third embodiment of the present invention will be described with reference 
to Figs. 29, 30, and 31 (A)-31 (C). The structured document edit system 30" of Fig. 
29 differs from ttiat of Fig. 19 in that the former arrangement further comprises an 
5 input document detection request receiver 298. Fig. 30 is a flow chart which shows 
the steps which characterizes the operation of the third embodiment On the other 
hand, each of Figs. 31 (A)-31 (C) shows an output document generated from the 
system 30". 

It will be understood that the above disclosure is representative of only three 
1 0 possible embodiments of the present invention and that the concept on which the 
invention is basedi is not specifically limited thereto. 
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What is claimed ;is: 

1. A method of editing a plurality of structures documents, comprising the steps 

of. 

(a) acquiring a plurality of structured documents in a document edit system; 

<fa) extracting a plurality of dements in each of said plurality of structured 
documents using an element edit statement which indicates element to be extracted, 
wherein the elements are extracted while relationship of the elements extracted is 
maintained. 
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ABSTRACT OF THE DISCLOSURE 

A technique of editing a plurality of structured documents is disclosed. A 
plurality of structured documents are inputted in a document edit system. Thereafter, a 
S plurality of elements are extracted from each of the plurality of structured documents 
using an element edit statement which indicates element to be extracted. The 
extraction of the elements is implemented while the relationship of the elements 
extracted is maintained. 

10 
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FIG. 13 
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FIG. 19 
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< elcment> 
<title of paper> 
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</tit!e of paper> 
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</element> 
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</ elements 
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10 

</total number> 



</list o- 
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My residence, post office address, and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint inventor (if plural names are 
listed below) of the subject matter which is claimed and for which a patent is sought on the invention entitled: 

A METHOD OF EDITING STRUCTURED DOCUMENTS 

the specification of which is attached hereto unless the following box is checked: 

E£l was filed on Jul\ 21. 1999 as United States Application Number or PCT International Application Number and 

was amended on (if applicable). 

I hereby state that I have reviewed and understand the contents of the above-identified specification, including the claims, as amended by any 
amendment referred to above. 

I acknowledge the duty to disclose information which is known by me to be material to patentability as defined in Title 37, Code of Federal 
Regulations § 1 .56. 

I hereby claim foreign priority benefits under Title 35, United States Code, § 119(a)-(d) or § 365(b) of any foreign application(s) for patent 
or inventor's certificate, or § 365(a) of any PCT International application which designated at least one country other than the United States, 
listed below and have also identified below any foreign application for patent or inventor's certificate, or PCT International application having 
a filing date before that of the application on which priority is claimed: 

PRIOR FOREIGN APPLICATION(S) 



DAY/MONTH/YEAR FILED 



PRIORITY 
CLAIMED 



8-348586 



JAPAN 



26 December 1996 



No 



H hereby claim the benefit under Title 35, United States Code § 119(e) of any United States provisional application(s) listed below. 



I APPLICATION NO. 


FILING DATE 















I hereby claim the benefit under Title 35, United States Code, § 120 of any United States application(s), or § 365(c) of any PCT International 
application designating the United States, listed below and, insofar as the subject matter of each of the claims of this application is not disclosed 
in the prior United States or PCT International application in the manner provided by the first paragraph of Title 35, United States Code, § 
1 12 I acknowledge the duty to disclose information which is known by me to be material to patentability as defined in _1 ltle 37, Code ot federal 
Regulations § 1.56 which became available between the filing date of the prior application and the national or PCT International riling date 
of this application: 



APPLICATION SERIAL NO. 



FILING DATE 



STATUS: PATENTED, PENDING, 
ABANDONED 



I hereby appoint as my attorneys, with full powers of substitution and revocation, to prosecute this application and transact all business m the 
Patent and Trademark Office connected therewith: Stephen A. Bent, Reg. No. 29,768; David A. Blumenthal, Reg. No. 26,257; William T. 
Ellis Ree. No 26 874" John J. Feldhaus, Reg. No. 28,822; Patricia D. Granados, Res. No. 33,683; John P. Isacson, Reg. No 33,715; 
Michael D. Kaminski, Reg. No. 32,904; Kenneth E. Krosin, Reg. No. 25,735; Glenn Law, Reg. No. 34,371; Eugene M. Lee, Reg. No. 
32 039- Richard Linn Reg. No.25,144; Peter G. Mack, Reg. No. 26,001; Brian J. McNamara, Reg. No. 32,789; Sybil Meloy Reg No. 
22'749 : Richard C Peet Reg. No. 35,792; George E. Quillin, Reg. No. 32,792; Colin G. Sandercock, Reg. No. 31,298; BernhardD. Saxe 
Reg. No. 28,665- Charles F. Schill. Reg. No. 27,590; Richard L. Schwaab, Reg. No. 25.479; Arthur Schwartz, Reg. No. 22.115; Harold 
C. Wegner, Reg.' No. 25,258. 

Address all correspondence to FOLEY & LARDNER, 3000 K Street, N.W., Suite 500, P.O. Box 25696, Washington, DC 20007-8696. 
Address telephone communications to David A. Blumenthal at (202) 672-5300. 
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I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are 
believed to be true; and further that these statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such willful false statements 
may jeopardize the validity of the application or any patent issued thereon. 



Full Name of First or Sole Inventor 

Motohide OTSUBO 


Signature of First or Sole Inventor 


Date 


Residence Address 
Tokyo, Japan 


Country of Citizenship 
Japan 


Post Office Address 

do NEC Corporation, 7-1, Shiba 5-chome, Minato-ku, Tokyo, Japan 




Full Name of Second Inventor 


Signature of Second Inventor 


Date 


Residence Address 


Country of Citizenship 


Post Office Address 






; ;:Full Name of Third Inventor 


Signature of Third Inventor 


Date 


4 ; Residence Address 


Country of Citizenship 


% Post Office Address 




' I Full Name of Fourth Inventor 


Signature of Fourth Inventor 


Date 


Residence Address 


Country of Citizenship 


Post Office Address 






Full Name of Fifth Inventor 


Signature of Fifth Inventor 


Date 


Residence Address 


Country ot Citizenship 


Post Office Address 



